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C.-Y. Chan, P. Felber, M. Garofalakis, R. Rastogi 

December The VLDB Journal — The International Journal on Very Large Data 
2002 Bases, Volume 1 1 Issue 4 

Publisher: Springer-Verlag New York, Inc. 

Full text available: ^Pcjf (383.34 Additional Information: full citation , abstract , references , cited by , 

KB) IMexjerms 
Bibliometrics: Downloads (6 Weeks): 6, Downloads (12 Months): 100, Citation Count: 16 

The publish/subscribe paradigm is a popular model for allowing publishers (i.e., data 
generators) to selectively disseminate data to a large number of widely dispersed 
subscribers (i.e., data consumers) who have registered their interest in specific ... 
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Bibliometrics: Downloads (6 Weeks): 13, Downloads (12 Months): 143, Citation Count: 25 

Several alternatives to manage large XML document collections exist, ranging from 
file systems over relational or other database systems to specifically tailored XML 
base management systems. In this paper we give a tour of Natix, a database 
management ... 
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3 Query op timization for OLAP-XML federations 
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November DOLAP '02: Proceedings of the 5th ACM international workshop on Data 
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Publisher: ACM 

Full text available: gPdf (205.89 Additional Information: full citation , absna c i, < ^ ; ■ . ^ , --suv <x , 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 10, Downloads (12 Months): 83, Citation Count: 3 

The changing data requirements of today's dynamic business environments are not 
handled well by current OLAP systems. Physically integrating unexpected data into 
such systems is a long and time-consuming process making logical integration, i.e., 
federation, ... 
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April WWW '01 : Proceedings of the 10th international conference on World Wide 
2001 Web 
Publisher: ACM 
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Bibliometrics: Downloads (6 Weeks): 30, Downloads (12 Months): 226, Citation Count: 202 

Schema matching is a basic problem in many database application domains, such as 
data integration, E-business, data warehousing, and semantic query processing. In 
current implementations, schema matching is typically performed manually, which 
has significant ... 

Keywords: Graph matching, Machine learning, Model management, Schema 
integration, Schema matching 



6 Managing security p olicies in a distributed environment using extensible markup 
language (XML) 

Nathan N. Vuong, Geoffrey S. Smith, Yi Deng 

March SAC '01 : Proceedings of the 2001 ACM symposium on Applied computing 
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Publisher: ACM 
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Bibliometrics: Downloads (6 Weeks): 6, Downloads (12 Months): 68, Citation Count: 4 

Keywords: Java, RBAC, XML, distributed authorization, managing security policies, 
meta-language 
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November The VLDB Journal — The International Journal on Very Large Data 
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Full text available: 'Qpdf (241 .60 Additional Information: full citation , abstract , references , cited by, 
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Bibliometrics: Downloads (6 Weeks): 7, Downloads (12 Months): 49, Citation Count: 5 

We are interested in defining and querying views in a huge and highly heterogeneous 
XML repository (Web scale). In this context, view definitions are very large, involving 
lots of sources, and there is no apparent limitation to their size. This raises ... 
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Damien K. Fisher, Franky Lam, William M. Shui, Raymond K. Wong 
^ November CI KM '03: Proceedings of the twelfth international conference on 
2003 Information and knowledge management 

Publisher: ACM 

Full text available: gpdf (127.44 Additional Information: h.'i , l\ - \ ,vs, u t v-vw, v : ^ ^ , 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 3, Downloads (12 Months): 76, Citation Count: 2 

With the increasing popularity of XML, there arises the need for managing and 
querying information in this form. Several query languages, such as XQuery, have 
been proposed which return their results in document order. However, most recent 
efforts focused ... 

Keywords: XML, dynamic, order maintenance, semi-structured data 
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August Proceedings of the 19th international conference on Computational 
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Publisher: Association for Computational Linguistics 

Full text available: Qrvi (160.03 Additional Information: full citation , abstract , rou v . 

KB) 

Bibliometrics: Downloads (6 Weeks): 0, Downloads (12 Months): 10, Citation Count: 0 

This paper investigates methods to automatically infer structural information from 
large XML documents. Using XML as a reference format, we approach the schema 
generation problem by application of inductive inference theory. In doing so, we 
review and ... 
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jik Petra Saskia Bayerl, Harald Lungen, Daniela Goecke, Andreas Witt, Daniel Naber 

November DocEng '03: Proceedings of the 2003 ACM symposium on Document 
2003 engineering 
Publisher: ACM 

Full text available: QpcJi (230.61 Additional Information: \ , «i v . abstract, refe-H «\ * ^ 
KB) 

Bibliometrics: Downloads (6 Weeks): 4, Downloads (12 Months): 52, Citation Count: 1 

We present an approach on how to investigate what kind of semantic information is 
regularly associated with the structural markup of scientific articles. This approach 
addresses the need for an explicit formal description of the semantics of text- 
oriented ... 
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September SI Gl R '01 : Proceedings of the 24th annual international ACM SIGIR 
2001 conference on Research and development in information retrieval 

Publisher: ACM 

Full text available: gpdf (234.90 Additional Information: tun , ,t \: \ ,v , u i * - ^ , a ^ , 

KB) indjWLtefiQS 
Bibliometrics: Downloads (6 Weeks): 10, Downloads (12 Months): 84, Citation Count: 49 

Based on the document-centric view of XML, we present the query language XI RQL. 
Current proposals for XML query languages lack most IR-related features, which are 
weighting and ranking, relevance-oriented search, datatypes with vague predicates, 
and ... 



12 Constructing a virtual primary key for fingerprinting relation,- , av 

Yingjiu Li, Vipin Swarup, Sushil Jajodia 
V October DRM '03: Proceedings of the 3rd ACM workshop on Digital rights 
2003 management 
Publisher: ACM 

Full text available: "Qpi? (240.49 Additional Information: full citation , abstract , refer e ic 2§, ted >\ , 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 5, Downloads (12 Months): 79, Citation Count: 6 

Agrawal and Kiernan's watermarking technique for database relations [1] and Li et 
al's fingerprinting extension [6] both depend critically on primary key attributes. 
Hence, those techniques cannot embed marks in database relations without primary 
key ... 

Keywords: fingerprinting, primary keys, relational databases, robustness 

13 Qu i - x c 1 an information economy 

^ R. Braumandl, A. Kemper, D. Kossmann 

W November ACM Transactions on I nternet Technology ( TO I T) , Volume 3 Issue 
2003 4 
Publisher: ACM 

Full text available: 'Qpol (829.15 Additional Information: fulj citation, abstract, reference?, N ' v i>\ , 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 11, Downloads (12 Months): 162, Citation Count: 6 

Accessing and processing distributed data sources have become important factors for 
businesses today. This is especially true for the emerging virtual enterprises with 
their data and processing capabilities spread across the Internet. Unfortunately, ... 

Keywords: Quality of Service 
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September VLDB '2003: Proceedings of the 29th international conference on 
2003 Very large data bases - Volume 29, Volume 29 

Publisher: VLDB Endowment 

Full text available: gpdf (129.35 Additional Information: n««i . !* *\ „vs, ut n.^w*, o : » » ^ , 

KB) indjWLtefiQS 
Bibliometrics: Downloads (6 Weeks): 5, Downloads (12 Months): 42, Citation Count: 1 

Querying large numbers of data sources is gaining importance due to increasing 
numbers of independent data providers. One of the key challenges is executing 
queries on all relevant information sources in a scalable fashion and retrieving fresh 
results. ... 



Evaggelia Pitoura, Serge Abiteboul, Dieter Pfoser, George Samaras, Michalis Vazirgiannis 

September ACM SI GMOD Record, Volume 32 Issue 3 

2003 

Publisher: ACM 

Full text available: ■gpof (282.99 Additional Information: full citation. abM , , 
KB) 

Bibliometrics: Downloads (6 Weeks): 7, Downloads (12 Months): 57, Citation Count: 4 

The challenge of peer-to-peer computing goes beyond simple file sharing. In the 
DBGIobe project, we view the multitude of peers carrying data and services as a 
superdatabase. Our goal is to develop a data management system for modeling, 
indexing and ... 

Keywords: global computing, metadata, peer-to-peer computing, peer-to-peer 
databases, pervasive computing, services, ubiquitous computing 
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Berthold Crysmann, Anette Frank, Bernd Kiefer, Stefan Muller, Gunter Neumann, Jakub 
Piskorski, Ulrich Schafer, Melanie Siegel, Hans Uszkoreit, Feiyu Xu, Markus Becker, Hans- 
Ulrich Krieger 

July ACL '02: Proceedings of the 40th Annual Meeting on Association for 

2001 Computational Linguistics 

Publisher: Association for Computational Linguistics 

Full text available: QpcJi (120.20 Additional Information , ^wv^\ \o \ 
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Bibliometrics: Downloads (6 Weeks): 1 , Downloads (12 Months): 26, Citation Count: 6 
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We present an architecture for the integration of shallow and deep NLP components 
which is aimed at flexible combination of different language technologies for a range 
of practical current and future applications. In particular, we describe the 
integration ... 
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November The VLDB Journal — The International Journal on Very Large Data 
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Bibliometrics: Downloads (6 Weeks): 24, Downloads (12 Months): 31 1 , Citation Count: 1 8 

On the Semantic Web, data will inevitably come from many different ontologies, and 
information processing across ontologies is not possible without knowing the 
semantic mappings between them. Manually finding such mappings is tedious, error- 
prone, and ... 



Keywords: Machine learning, Ontology matching, Relaxation labeling, Semantic 
Web 
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May WWW '03: Proceedings of the 12th international conference on World Wide 
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Publisher: ACM 

Full text available: Qpdf (237.35 Additional Information: full citation, abstract, references, sled by.. 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 3, Downloads (12 Months): 71, Citation Count: 14 

This paper describes a novel approach for obtaining semantic interoperability among 
data sources in a bottom-up, semi-automatic manner without relying on pre-existing, 
global semantic models. We assume that large amounts of data exist that have 
been ... 
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V" June ACM SI GMOD Record, Volume 30 Issue 2 
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Publisher: ACM 

Full text available: gpdf (331 .70 Additional Information: lull c it y on, abstract, ret yences, - ft 5d bj , 

KB) LQdex.terms 
Bibliometrics: Downloads (6 Weeks): 24, Downloads (12 Months): 220, Citation Count: 25 

In this paper we present a method for automatically segmenting unformatted text 
records into structured elements. Several useful data sources today are human- 
generated as continuous text whereas convenient usage requires the data to be 
organized as structured ... 
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Stephen Dill, Nadav Eiron, David Gibson, Daniel Gruhl, R. Guha, Anant Jhingran, Tapas 

Kanungo, Sridhar Rajagopalan, Andrew Tomkins, John A. Tomlin, Jason Y. Zien 

May WWW '03: Proceedings of the 12th international conference on World Wide 
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Publisher: ACM 

Full text available ^Pdl (178 36 Additional Information * , - ^^v'-s^b 

KB) index terms 

Bibliometrics: Downloads (6 Weeks): 23, Downloads (12 Months): 273, Citation Count: 43 

This paper describes Seeker, a platform for large-scale text analytics, and SemTag, 
an application written on the platform to perform automated semantic tagging of 
large corpora. We apply SemTag to a collection of approximately 264 million web 
pages, ... 

Keywords: automated semantic tagging, data mining, information retrieval, large 
text datasets, text analytics 
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